CLAIMS 

1. (amended) A metadata production device, comprising- 

a content reproduction portion that reproduces and outputs content; 
5 an voice input portion; 

a voice recognition portion that recognizes voice signals that are 
input from the voice input portion; 

a metadata generation portion that converts information recognized 
hy the voice recognition portion into metadata; 
10 an identification information attaching portion that obtains 

identification information for identifying positions within the content from 
the content and attaches the identification information to the metadata; and 

a dictionary that is Umited in accordance with the content; 

whereby the generated metadata is associated with positions in the 
15 content; and 

the recognition is performed in association with the dictionary, when 
recognizing the voice signals input from the voice input portion with the 
voice recognition portion. 

20 2. (canceled) 

3. (amended) The metadata production device according to claim 1, 

wherein the voice signals are recognized by the voice recognition 
portion word by word in association with the dictionary. 

25 

4. The metadata production device according to claim 1 or 3, 

further comprising an information processing portion including a 
keyboard, wherein the metadata can be corrected through the information 
processing portion by input from the keyboard. 

30 

5. (amended) The metadata production device according to any of claims 1 
and 2 to 5, 

wherein time code information that is attached to the content is used 
as the identification information. 

35 

6. (amended) The metadata production device according to any of claims 1 
and 2 to 6, 



wherein content addresses, numbers or frame numbers attached to 
the content are used as the identification information. 

7. The metadata production device according to claim 1, 

5 wherein the content is still-picture content, and the addresses of the 

still-picture content are used as the identification information. 

8. The metadata production device according to claim 1, 

wherein the content reproduction portion is configured by a content 
10 database; 

wherein the voice input portion supplies to the voice recognition 
portion voice signals of entered keywords that have been converted into data 
with a clock signal that is synchronized with a synchronization signal 
supplied fi-om the content database; 

15 wherein the voice recognition portion is configured to recognize the 

keywords from the voice signal data that has been converted into data by the 
voice input portion; and 

wherein the metadata generation portion is configured as a file 
processing portion that produces a metadata file by using, as the 

20 identification information, a time code that indicates a time position of an 
image signal that is included in the content, and combining the keywords 
that are output from the voice recognition portion with that time code. 

9. The metadata production device according to claim 8, 

25 further comprising a recording portion that records the content that 

is supplied from the content database together with the metadata file as a 
content file. 

10. The metadata production device according to claim 9, 

30 further comprising a content information file processing portion that 

generates a control file controlling the relation between the metadata file 
and recording positions to be recorded by the content file; 

wherein the control file is recorded in the recording portion together 
with the content file and the metadata file. 

35 

11. The metadata production device according to claim 8, 

further comprising a dictionary database, wherein the voice 



20 



recognition portion can select a dictionary of a genre corresponding to the 
content from a plurality of genre -dependent dictionaries. 



12. The metadata production device according to claim 11, 
5 wherein keywords related to the content can be supplied to the voice 

recognition portion! and 

wherein the voice recognition portion is configiu-ed to recognize those 
keywords with higher priority. 

10 13. (amended) A method for producing metadata, comprising: 
voice "inputting information related to a given content while displaying the 
content on a monitor; subjecting the input voice signal to voice recognition 
with a voice recognition device using a dictionary that is Umited in 
accordance with the content; converting voice -recognized information into 

15 metadata; and attaching identification information provided to the content 
for identifying positions in the content to the metadata, thereby associating 
the generated metadata with the positions in the content. 

14. (canceled) 

20 

15. (canceled) 

16. The method for producing metadata according to claim 13, 
wherein time code information that is attached to the content is used 

25 as the identification information. 

17. The metadata production device according to claim 13, 

wherein the content is still -picture content, and the addresses of the 
still-picture content are used as the identification information. 

30 

18. A metadata search device, comprising^ 

a content database that reproduces and outputs content; 

an voice input portion that converts voice signals of entered keywords 
into data with a clock signal that is synchronized with a synchronization 
35 signal of the reproduced content; 

a voice recognition portion that recognizes the kejrwords firom the 
voice signal data that has been converted into data by the voice input 



portion; 

a file processing portion that produces a metadata file by combining 
the keywords that are output fi-om the voice recognition portion with time 
codes that indicate a time position of an image signal that is included in the 
5 content; 

a content information file processing portion that generates a control 
file controlUng a relation between the metadata file and recording positions 
of the content file; 

a recording portion that records the content file, the metadata file 
10 and the control file; and 

a search portion that extracts a recording position corresponding to a 
keyword in the content file by specifying the metadata files in which an 
entered search keyword is included, and referencing the control file; 

wherein the recording position of the content file is the recording 
15 position in the recording portion. 

19. The metadata search device according to claim 18, 

wherein the control file that is output from the content information 
file processing portion is devised as a table that Hsts recording positions of 
20 content in the recording portion in accordance with a recording time of the 
content, and the recording position of the content can be searched fi-om the 
time code. 

20. The metadata search device according to claim 18, 

25 further comprising a dictionary database, and a keyword supply 

portion that supplies keywords related to the content into the voice 

recognition portion; 

wherein the voice recognition portion can select a dictionary of a 

genre corresponding to the content fi'om a pluraHty of genre "dependent 
30 dictionaries, and the voice recognition portion is configxired to recognize 

those keywords with higher priority. 

21. The metadata search device according to claim 18, 
further comprising a dictionary database; 

35 wherein the voice recognition portion can select a dictionary of a 

genre corresponding to the content firom a plurality of genre-dependent 
dictionaries; and 




22 



wherein the search portion is configured to search by keywords that 
are chosen from a common dictionary used by the voice recognition portion. 



